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Abstract 

Substances such as chemical compounds are invisible to human eyes, they 
are usually captured by sensing equipments with their spectral fingerprints. 
Though spectra of pure chemicals can be identified by visual inspection, the 
spectra of their mixtures take a variety of complicated forms. Given the knowl¬ 
edge of spectral references of the constituent chemicals, the task of data fitting 
is to retrieve their weights, and this usually can be obtained by solving a least 
squares problem. Complications occur if the basis functions (reference spec¬ 
tra) may not be used directly to best fit the data. In fact, random distortions 
(spectral variability) such as shifting, compression, and expansion have been ob¬ 
served in some source spectra when the underlying substances are mixed. In this 
paper, we formulate mathematical model for such nonlinear effects and build 
them into data fitting algorithms. If minimal knowledge of the distortions is 
available, a deterministic approach termed augmented least squares is developed 
and it fits the spectral references along with their derivatives to the mixtures. 
If the distribution of the distortions is known a prior, we consider to solve the 
problem with maximum likelihood estimators which incorporate the shifts into 
the variance matrix. The proposed methods are substantiated with numerical 
examples including data from Raman spectroscopy (RS), nuclear magnetic res¬ 
onance (NMR), and differential optical absorption spectroscopy (DOAS) and 
show satisfactory results. 


1 Introduction 

Given a (or multiple) noisy spectral observation(s) of a sample containing several 
sources (e.g., chemicals), one attempts to retrieve the weights of source signals in the 
mixtures. This can be solved by the least squares type of methods, which are the most 
widely used data analysis tools with numerous applications. Suppose the observations 
are linear mixtures of the source signals 

A" A S T A7 , with X.^ A.^j . Sij sy, - AGj riij , (1.1) 

where X G M mxp is the m x p observation matrix (mixture matrix), each row ay 
stands for a single observation of length p\ S is the basis (or source) matrix with each 
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Si being a pure source signal; A is the mixing matrix and has size n x m, and N is 
the noise matrix of the same size as X. Given S. X, the least squares solution of the 
mixing matrix is given by A — XS T (SS T )~ 1 which is the minimizer of the following 
problem 

nun||X- AS\\l . (1.2) 

If the noise is normally distributed and variances of the observations are equal, 
the above ordinary least squares (OLS) method provides minimum-variance mean- 
unbiased estimation, and is actually the maximum likelihood estimator. When the 
variances of the observations are unequal or certain correlations exist among the ob¬ 
servations, the OLS can be inefficient and lead to poor estimates on the mixing matrix. 
In some cases, one can apply a generalized least squares (GLS) model, 

A = argmin(X - AS)Q~\X - SA) T (1.3) 

with Q being the covariance matrix of the noise. The estimate of A has an explicit 
formula A = XQ 1 S T (SQ^ 1 S 1 )^ 1 In practice, one needs to estimate Q for the 
implementation of GLS since the covariance matrix is in general unknown. Hence 
an implcmentable version of GLS usually contains two steps; We first solve an OLS 
problem (or other variant least squares) to obtain an estimate of Q by the fitting 
residuals; Then we implement the GLS by (II.3p . Least squares type methods have 
been very successful with numerous applications for data fitting. However, compli¬ 
cations occur if the basis functions (reference spectra S ) may not be used directly 
to best fit the data. For example, the reference spectra may have either shifting or 
compression/expansion when the source objects are present in a mixture, these effects 
are also called spectral variability. Then the reference spectra need to be properly 
registered before fitting. 

In this paper, we are concerned with a regime where the source signals possess 
such spectral variability in addition to the noise and measurement errors. This has 
been a challenging problem in signal processing, for example, in hyperspectral (HSI) 
demixing [4], the endmembers (source signals) exhibit either mismatchs to their lab 
measured spectral signatures (for example, shifting along the wavenumber axis) or 
random distortion (some peaks are being compressed or expanded), these spectral 
variability are caused by many factors such as lighting conditions, geographic loca¬ 
tions, seasons, etc. For instance, consider a pixel (corresponding to an observation 
in X) of a mixture of soil and vegetation from a hyperspectral image, the optical 
reflectance spectra of soils from different geographic sites or under different lighting 
conditions are generally different. Similar phenomena has been observed in Raman 
spectroscopy, where spectral lines of source species can have random shifts along the 
wavenumber axis. In differential optical absorption spectroscopy (DOAS) [2], spectra 
are collected by moving a grating motor. Changes in grating position and temperature 
lead to changes in dispersion of the light beam on the detector. In nuclear magnetic 
resonance spectroscopy (NMR [12] ), laboratory NMR spectrometers usually produce 
spectra of high resolution. However, spectral variations may exist in the data as the 
instruments age or the changing environment. 

To handle the spectral variabilities, experimentalists often adjust the reference 
spectra on a lookup table by knowledge of the sensing process. In hyperspectral pro¬ 
cessing, researchers have proposed various methods to deal with the spectral variability 


for unmixing the observed pixels, the methods include deterministic approaches and 
statistical approaches (readers are referred to [1] and references therein). For example, 
authors in Em design a selection criterion to include as many as possible variety of re¬ 
flectance spectra of similar endmembers, the hyperspectral data is then unmixed with 
all possible different combinations of endmember signatures. In hyperspectral target 
detection applications, a robust matched filter was designed in |8j by allowing the 
mismatch between the target source spectrum and its reference within a predefined 
e sphere. In si. an l\ minimization based approach for robust template matching 
(data matching) in hyperspectral classification and target detection. Other statistical 
approach includes Bayesian spectral mixture analysis method [2] which uses the end- 
member signature probability distribution in the analysis for maximally capturing the 
spectral variability of an image with the least number of endmembers. Although the 
spectral variability in hyperspectral image processing has been extensively studied, 
the assumptions and methods are all seeming to be limited to specific applications 
and could be ad hoc in many cases. Less has been done for other spectral mixtures 
including Raman, NMR, and DOAS spectroscopy. In this work, we shall assume a 
database containing only one reference spectrum for each source, and the objective 
is to improve the estimates of the mixing matrix. This really set our work apart 
from these in hyperspectral unmixing. Given the fact that the random shifts is rather 
small comparing to the signal length, a natural idea is to include the derivatives of the 
templates into the fitting basis (matrix S'), we then have an augmented source matrix 
whose additional columns are the derivatives of the spectral references. When more 
information is known about the shifts such as their statistical distributions, then we 
should include this information into the approach. Under this assumption, we con¬ 
sider two scenarios both of which has random shifts in source spectral lines. In the 
first case, the random shifts from source spectra are assumed to be independent and 
identically distributed. In the second scenario, the random shifts are assume to have 
serial correlations (the shifts in mixtures from one source follow an autoregressive 
model), which cause correlations between mixtures. In the first case, we can solve 
the problem in a sequential manner, i.e., treat one observation at a time to achieve 
the mixing coefficients of the source for that particular observation. While the ob¬ 
servations can not be treated separately if the shifts are serially correlated, they are 
coupled all together. 

The paper is structured as follows; In section 2, we develop the methods and their 
mathematical details. In section 3, numerical investigations are performed to validate 
the methods proposed, in addition comparisons between the methods are provided; 
Concluding remarks and future works are given in section 4. Throughout the paper, 
the following notations will be used: A in bold face stands for matrix, a, is its ith row 
and a? its jth column. Greek letter • is a random variable. 
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Figure 1.1: spectral distortions. 

2 The Methods 

We are interested in modeling the shift and compression (or expansion) effects oc¬ 
curring in sensing, and build them into data matching algorithms. Let us consider 
modeling shift effect by writing each row of X as Xi(u), i — 1, • ■ • , m, u can be wave¬ 
length, frequency, or time depending on physical origins of the problems. It gives 
the row entries when discretized. In the model, each source signal may have different 
shifts in different mixtures. Let the shift on row Sj = Sj(u) of S be denoted by 
which means the shift of source Sj in mixture aq. The nonlinear mixing model with 
shift adjustment is: 

n 

Xj ^ ^ T £,ij) T , (2.1) 

3 =1 

and the related minimization problem is 

n 

( aij , Cij) = argmin ||®i - «,,*,(// + Cij )lb , (2.2) 

and it is non-convex. A similar problem was studied in [TU] based on alternating 
least squares estimation with Levenberg-Marquart iterative method. In the context 
of image registration, the author in JB] propose a convex approximation of (12.7j) . Both 
two methods assume that each source signal has the same shift in all mixtures. In 
practice, shifts of the same source may be different in different mixtures, and this is 
more difficult and complicated. 

2.1 Augmented Least Squares 

The idea of augmented least squares is to fit the reference spectra with their derivatives 
to the observations. Given that the shifts are small comparing with the signal length, 






(2.3) 


we shall use a truncated Taylor expansion of s 3 u + ^ l3 as an approximation, 


Sj iy t Czj) 



given the existence of the derivatives of s,;. In practice, it usually uses the first several 
terms as an approximation, in many tested cases, the first and second derivatives 
work well, more terms may be included as needed. Knowing that the each mixture 
actually is linear combination of all the sources and their derivatives, we include both 
the reference spectra and their derivatives to a form an augmented fitting basis as 


S = 


[Si| SjJ S2I ^2? 



where only the first derivatives are included. We will then solve the following aug¬ 
mented least squares problem (AgLS) 

min||X~ AS\\l , (2.4) 

A 

for the linear mixture model 

X = AS + N (2.5) 

~ ~T-T ~ 

The solution is A = XS (SS ) , here A contain weights of all the source signals 

and their derivatives. 

Besides modeling the shift effect, augmented least squares is able to solve com¬ 
pression/expansion effects occurring in source signals for data matching (plots in Fig. 
II.111 . In fact, let the compression/expansion scale on each source signal s 3 be de¬ 
noted by Vj whose value is close to 1. The nonlinear mixing model with distortion 
adjustment is: 

n 

Xi = aijSj(vjU ) + Ni , where i — 1, ■ ■ ■ , m , (2.6) 

i=i 

and the related minimization problem is 

n 

(A, v) = argmin||£Cj - } a^s j (u J u/)|| 2 . (2.7) 

3 = 1 

This minimization is non-convex. Using the same idea as the augmented least squares, 
let Vj — 1 + 5j, then 

SjiyjV) = Sj(v + 8jv) = 8j{u) + s)(z/)(5jZ/ H-. 

Then the rest follows the equations (12.411 . (12.511 . 


2.2 Augmented Maximum Likelihood Estimators 

In this augmented least squares method, no prior information such as the statistical 
distributions, means, etc., of the random shifts are assumed. Hence it is a rather 
general approach and can be applied in many situations when the knowledge of the 
random shifts are limited. If certain statistics about the shifts are known a priori, we 
should take into account the information. 




Let £ij,£ 2 t, iimi be the random shifts from the ith source signal in all the 
mixtures. We rewrite model (12. 5 p by including the source spectral and their first 
derivatives along with random shifts variables. 

X = AS+(AQS)S' + N, (2.8) 

where matrix S' contains the first derivatives of entries of S , and columns of S are 
vectors ■ • • , £mj] T G = 1, ■ ■ ■ ,n. Symbol 0 means the element-wise multipli¬ 

cation. Below we have the model in form of matrix elements, 

n n 

%ij ^ ^ dik^kj “I - ^ ^ (lik£,ikSkj T ^ f 3 ■ (2-9) 

fc=1 fc=l 

Clearly, the randomness and uncertainty of the observations include two parts: noises 
N and the random shift (A © E)S". Then the mixture matrix X has mean AS and 
follow the same distribution as (A 0 3)5' + N. 

Let G M mxp with T kyj = a ik s' kj , k — 1, ■ ■ ■ ,n. Notice that = T fc (A) depends 
on the unknown mixing matrix A, then we have 

n n 

%ij ^ ^ (^ik^kj T ^ ^ b k,ij£,ik T -©j (2.10) 

fc=l fc=l 

Below we discuss two cases according to the correlations among the mixtures. 

2.2.1 Statistical Model Case I: Heteroscedasticity 

Assume that £*& are independent and identical Gaussian A/"(0, erf), k = 1, • • • , n, while 
% r A^(0, r 2 ), and ^ lk and N l3 are independent. Clearly, we have the expectation of 

n 

Xij, E (xij) = £ ciikSkj, and the covariance between entries of X are 

k=i 


n n 

Cov((Cy, Cov( ^ ^ r k,ij£,ik < ) ^ ^ r kfij'^ik) 

k =1 k =1 

n 

= ^ ] ^k,ij^k,i'j ,(J k ' -i 7 ^ j 

k =1 
n 

E 2 / / 2 

a ik s kj s kf a k- 

k =1 

And we know Co v(xij : Xi'ji) = 0, if i ^ i' since different rows of matrix X are 
uncorrelated. However, elements within each rows are correlated, for if we rewrite the 
linear model (12. 5 p in terms of their rows (observations) 

n n 

Xi = ^ dijSj + ^ dijZijs'j + Ni , where % — 1, • • ■ , m , (2.11) 

j=i l=i 


apparently the ith row (observation) Xi is independent to the A;-th row x k for i ^ k. 





Let vec(X) G X mpxl £] ie vectorization of the mixture matrix X. Note that 
X = [xj, xj,..., with Xi being a row vector of length p, we shall stack rows of 
X to form the vec(X), that is 

vec(X) = [x u , x 12 , ■■■ , x lp , ■■■ , x ml , • • • , x mp ] T . (2.12) 


Then vec(X) ~ J\f mp (vec(AS), V ), where Af mp denotes mp— variate normal distri¬ 
bution, and 


n 


0 \ 

T Impxmp T ^ ^ ^fe^fc 



fe=l 

.0 

v m / 


where fi fc G = diag ((r*)?\(r fc )i i: , • • • , {T k )J. (T,) v , • • • , (r*)£ .(r fe ) m ^ , here 

(T/c)*,: G lZ lxp = ai k s' k .. Hence Q, k is a block diagonal matrix, and Vi is p x p matrix. 
The log-likelihood function is 

vec(X - - AS)) - - ln(27r) . 

(2.13) 

Notice that V = V(a 2 ,A) is determined by a 2 and A. If V,cr 2 , and r 2 are known, 
the maximized likelihood estimator (AgMLE) of A is the solution of m generalized 
least squares, where A i : , the AgMLE of A i; . (a row vector containing the weights of 
source signals in the ith mixture) for the ith mixture is 

Ai. = argmin ||Vi _ 1 / 2 (X i>: - A i S) T ||^ , (2.14) 


£(4-T 2 ,T 2 |X,S) = -bndet(V)-l( 


and it has a closed form 


A l , = X i ,Vr 1 S T (SVr 1 S T )- 1 , 

and Var(A i .) = (SVr 1 ^)- 1 . We propose the following iterative approach for 
AgMLE of (A, a 2 , r 2 ), 

1, Start with an initial A^ 0 ) obtained by the ordinary least squares, 

min ||X — AS ||2 

whose solution is A<°> = XS T (SS T )- 1 . Then T^ 0) whose entry is T®. = 
A ® S', • 


2, Obtain £ = (£ ik ) by solving 


(&fc) = argmin ||(X - A (0) S) ii: - ^(T fc )i, : &, 


2 

fell 2 


Then = 


fe 0) = \ &k/ m , 

\ i=1 

n 

Er*)hrOv(4 0) ) 2 . 


T° — 


^/mp 


[x - A^s)i, - ^(r fc )i i: &,*||2; v 


( 0 ) 


fe=l 







3, Update A (0) by 


— argmin ||V) 



4, Set A^ 0) and iterate step 2-3 until it converges. 

The resulting variance of the estimate is given by 



percentile of a standard normal density. 

2.2.2 Statistical Model Case II: Autoregressive Model 

In this part, we are concerned with a scenario where the mixtures (observations) are 
correlated. The correlation might due to many factors such as the dispersion of the 
chemical compounds examined which is in generally a function of time, hence shift 
from one source existing in the observation at current time will have influence on the 
shift in the mixture of future experiment. Suppose the observations are associate with 
time, meaning that we shall have a time series of mixtures. Then we assume that the 
correlation between adjacent time the random shifts are stronger than that between 
shifts over longer time. Under such condition, we propose to use autoregressive model 
AR(1) to characterize the random shifts exhibited in the mixtures. We still assume an 
iid noise N t] ~ A/"(0, r 2 ),i = 1, ■ ■ ■ ,m, j = 1, ■ ■ ■ ,p. For any source k = 1 , • • • , n, we 
assume £ lfc , £ 2 fc, ■ • • , C, m k, the shifts of source k in all mixtures, are correlated according 
to 


P,ik Pk^i—l,k T 'U'ik j |Pfc| 1 


(2.15) 


1 2 , Corr(f ifc , Ci-t,k) = 

I- pi 


with Uik ~ A/"(0, a 2 ), £o,fc = 0. This implies that Var(^) 


PI 


p\. We also assume that ^ and N l3 are independent, furthermore, (£ifc, • • • ,£ m k) T 
(£i k>, ■ ■ ■ ,£mk') for k 7 ^ k!. Following the model (12.10j) . we have 


n 


Var (xij) = Var(y^(r fc;i j^ fc ) + r 2 , 


k =1 



n 


n 


CcfV^Xij , Cov( ^ ' (J-'kyij fjiki ^ kftj’ fji'k) 


k=\ 


k=\ 


n 



k =1 










We take the vectorization of the mixture matrix X as defined in (12 .121) . then vec(X) 
J\fmp (vec(A5), V ), and 


V = t 2 I 


mpxmp 


k=1 


crt 


1 - Pk ’ 


where 


^ ^fc,ll Qk, 12 • • • ^ 


— 


(2.16) 


mpxmp 


Y ^k,m2 ■ ■ ■ ^ k,mm ) 

here flk,ij = (^'k)p.(^'k)j,:Pk ^ which is no longer a diagonal matrix. We write 

/ Vn v 12 ... v lm \ 


V = 


(2.17) 


y Knl v m2 

• • • Vmm J 


mpxmp 


\i-3\°k 


+ r 2 I pxp I(i = j), where I is the 


where V* G M pxp = ]T(r fe ) £(1^,^ , _ 

k 1 pk 

indicator function. 

The log-likelihood function takes the same form as (12.131) . We propose the follow¬ 
ing iterative algorithm for estimating the parameters, 

1, Start with an initial A^ 0 ) obtained by the ordinary least squares. 

2, Obtain £ = (f ifc ) by solving 

£ik = argmin ||(X - A (0) S) i): - ^(rjj. 0) ) i|: f iifc ||2 • 


Then we regress £ ik on £j_i <k for each k and we obtain p^ 1,fc , and 


(o) 

°k = 




2 

ik T 0 _ 


m 


y/mp 


y~' m c2 
Z—/z=2 M—1,/c 

{x - A^s) i>: - 22(r fe )i, ; 6,jfc|| 2 . 


( 0)\2 


3 . V f = V = j). Then update 

k=1 1 Pk 

by 

A (1) = argmin || V (0)_1/2 Vec(X - A (0) 5) T ||^ . 

4, Set A (0) <r- A (1) and iterate step 2-3 until it converges. 


3 Experimental Investigation 

We report here the numerical examples and results of the methods. We first present 
the results on synthesized data, and then we show how the methods work with the 
realistic data from Raman, NMR, and DOAS spectroscopy. Note that the methods re¬ 
quire certain smoothness of the data to guarantee the continuity of the derivatives,and 
this condition is satisfied approximately by the test we tested. 


















Figure 3.1: The first two rows (blue) are the source signals, and the last row is one 
noisy mixture with random shift from source signals (black). 

3.1 Synthesized examples 

In this part, we show the results of simulated examples for all the three scenarios: 
the augmented least squares (AgLS), heteroscedasticity, and autoregressive model. 
There are 2 sources, and 100 observations. The source spectra are in Fig. 13.11 The 
estimation of the parameters are presented below and in the Figs. I3.2H3.4I The 
comparisons with ordinary least squares (OLS) are also shown in the figures. It can 
be seen that the result of the AgLS are better than OLS, but are less accurate than 
the 2 other methods when the statistics of the shifts are known. However, the AgLS 
is a rather general method and computationally less complex and expensive. We also 
test the effectiveness of AgLS on linear compression/expansion for signals in Fig. 12.61 
The results are shown in Fig. 13.51 where the augmented least squares is able to deal 
with mildly linear compression or expansion. 

3.2 Applications to NMR, Raman, and DOAS spectroscopy 

1, Nuclear Magnetic Resonance (NMR) spectroscopy is a powerful and popu¬ 
lar tool for chemists and biochemists to investigate and determine the structures and 
properties of molecules. The NMR spectrum of a chemical compound is produced by 
the Fourier transformation of a time-domain signal which is a sum of sine functions 
with exponentially decaying envelopes. The real part of the spectrum can be pre¬ 
sented as the sum of symmetrical, positively valued, Lorentzian-shaped peaks. In the 
example of NMR spectroscopy, we use the true spectra of four chemical compounds, 
mannitol, /?—cyclodextrine, f3— sitosterol, and menthol (see Fig. 13.61 for their spectra). 
The coefficients of the mixing matrix A are generated from a uniform distribution in 
the range of [5,10] for each mixture. Then we obtain the mixtures by simulating the 
mixing process in m with randomly shifted source spectra and by adding Gaussian 
noise. The results of the estimations are listed in Figs. 13.7113.81 It once again shows 
that the methods proposed are better than the ordinary least squares. 











Figure 3.2: The result of the augmented least squares (AgLS), and its comparison to 
the ordinary least squares. The stand deviations of the random shifts of source signals 
are <j\ — — 1. The standard deviation of the noise is 0.05, the ground truth of the 

mixing coefficients (weights) of source 1 is one. 



Figure 3.3: Heteroscedasticity: the result of the augmented maximum likelihood esti¬ 
mator (AgMLE), and its comparison to the ordinary least squares. The stand devia¬ 
tions of the random shifts of source signals are a\ — \ and cr 2 = 1 , whose estimations 
are oq = 1.0722, a 2 — 0.7617. The standard deviation of the noise is r = 0.05 and 
its estimation is t = 0.1806, the ground truth of the mixing coefficients (weights) of 
source 1 is one. 




































Figure 3.4: Autoregressive model: the result of the augmented maximum likelihood 
estimator (AgMLE) for serial correction between mixtures, and its comparison to 
the ordinary least squares. The parameters used in AR(1) model are o\ — 1,<J2 = 
1 ,t — 0.05,pi = 0.5,p 2 = 0.4. Their estimations are by = 0.9714, d 2 = 1.0616,pi = 
0.6168, p 2 = 0.3887, f = 0.2367. The ground truth of the mixing coefficients (weights) 
of source 1 is one. 



Figure 3.5: The results of the augmented least squares (AgLS) on linear compression 
effect and comparisons to the ordinary least squares. Here the compression/expansion 
parameters Vj are in the range (0.80, 1.20), the ground truth of mixing matrix A are 
from uniform distribution of [4,5]. The absolute fitting error for the augmented least 
squares is 3.147, for the ordinary least squares is 3.632. 















































2, Raman spectroscopy: In a second realistic example, we apply our method 
to the Raman spectra. As shown in Fig. 13.91 a Raman spectrum gives a collection 
of peaks that correspond to the characteristic vibrational frequencies of the mate¬ 
rials being examine, thus providing unique information on the molecular structure 
and chemical composition of the matters [IB]. We test our method on Raman data 
provided by NSF/ATD program |T|. The sample consists of several liquid substances, 
some of them are commonly used for making explosives. The dataset include 21 mixed 
Raman spectra at different incident laser wavelengths. These are realistic data from 
Naval Research Lab and we do not have the ground truth of the mixing coefficients 
to compare to, moreover we do not have knowledge of the statistics of the random 
shifts. In this situation, it is clearly the augmented least squares is the most suitable 
method for estimation of the mixture coefficients. As a comparison we also shown 
the comparisons to the case 2 where IID Gaussian distributed shifts are assumed. 
We also compare our methods to the nonlinear least solver from di! The results of 
augmented least squares are close to those of nonlinear least squares (see Fig. I3.10p . 
although the augmented least squares is less complicated and easier to implement. On 
the other hand, the maximum likelihood results are much more deviated from other 
methods for the mixtures number 1-15 (Fig. 13. lip . This observation shows that shifts 
are not Gaussian or the existence of other nonlinear distortion such as compression 
or expansion in the signals. 

3, Differential optical absorption spectroscopy (DOAS) is based on the 
light absorption property of matter to identify broadband and narrow band spectral 
structures, and analyze atmospheric trace gases concentrations [9]. It can also help 
to understand the influence of atmospheric chemistry on climate and air quality. Fig. 
13.121 shows spectra reference of trace gases HONO and NO 2 and a DOAS spectrum 
of mixture sample containing these two gases. We are here to test the methods 
on realistic DOAS data. We are glad to find that the augmented least squares is 
able to deliver better results compare to the ordinary least squares, on the other 
hand, the maximum likelihood estimator (Heteroscedasticity) produces same results 
as the ordinary least squares, in fact, the matrix Q is actually diagonally dominant, 
and the shift effect is insignificant. However, the better results of augmented least 
squares implies that there are other spectral distortions such as linear compression 
or expansion. This once again shows that the augmented least squares are able to 
handle not only the shift but also other distortions in the spectra. 

4 Conclusion and Future Works 

In this paper, we are concerned with data matching where the source signals have 
random shifts and/or other nonlinear distortions. Novel methods are proposed for 
various scenarios according to the knowledge of the data, and they prove to work 
well with real-world data from NMR and Raman as well DOAS spectroscopy. The 
modeling of the nonlinear distortions is based on the truncated Taylor expansion of 
the signals. The computational approach is an augmented least squares which fits the 
reference spectra of the signals and their derivatives to the mixtures. If the random 
distortions are Gaussian, an augmented maximum likelihood estimator (AgMLE) is 
developed. In future work, we shall study non-Gaussian priors (such as Laplacian 









Figure 3.6: The reference spectra of mannitol, /3— cyclodextrine, (3— sitosterol, and 
menthol. 



Figure 3.7: Augmented least squares: the estimation of the weights of mannitol in 
the 100 mixtures. 






















































Figure 3.8: Augmented maximum likelihood estimator: the estimation of the weights 
of mannitol in the 100 mixtures. The parameters used in the example: a r — 2, i — 
1, ■ ■ ■ ,4,r = 0.05. The computed results of the parameters are &i = 0.9353, &2 = 
1.0264, da = 0.9229, <j 4 = 1.0766, t = 0.1940. 
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Figure 3.9: Raman spectra of some liquids used to make explosives: from left to right, 
Methanol, Acetonitrile, Ethanol, Water, and their mixture. 












































Figure 3.10: The estimation of mixing coefficients of Methanols computed by aug¬ 
mented least squares. As a comparison, we also present the results by nonlinear least 
squares. 



Figure 3.11: The estimation of mixing coefficients of Methanols computed by aug¬ 
mented maximum likelihood (AgMLE) estimator and comparison to other methods. 
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Figure 3.12: The spectral references of trace gases HONO and NO 2 , and their noisy 
mixture. 
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Figure 3.13: The computed coefficients of HONO and NO 2 by augmented least squares 
(AgLS) and comparisons with other methods. 





































Figure 3.14: The coefficients of HONO and NO 2 by augmented maximum likelihood 
estimator (AgMLE) and comparisons to other methods. 



Figure 4.1: signal with jumps and oscillations (right) is a slice (red line) of an image 
containing man-made buildings and trees. 


or hyper Laplacian) in maximum likelihood estimator to model the shifts and other 
distortions. For signals with jumps and/or high oscillations, the methods based on 
derivatives would fail to work. These signals can be found in intensity image of a 
building with trees and vegetation for image registrations (see Fig. nap. One idea is 
to model shifting by shift operator that is characterized by sparse matrix of one and 
zeros. The unknowns to be estimated are the mixing matrix and the shift matrix, 
and they are multiplied together, hence the problem is non-convex. A future work is 
to study convex approximation of the problems. 
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